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Abstract. The most successful parallel SAT and MaxSAT solvers fol¬ 
low a portfolio approach, where each thread applies a different algorithm 
(or the same algorithm conhgured differently) to solve a given problem 
instance. The main goal of building a portfolio is to diversify the search 
process being carried out by each thread. As soon as one thread fin¬ 
ishes, the instance can be deemed solved. In this paper we present a new 
open source distributed solver for MaxSAT solving that addresses two 
issues commonly found in multicore parallel solvers, namely memory con¬ 
tention and scalability. Preliminary results show that our non-portfolio 
distributed MaxSAT solver outperforms its sequential version and is able 
to solve more instances as the number of processes increases. 


1 Context and Motivation 

The Maximum Satisfiability (MaxSAT) problem can be defined as an optimiza¬ 
tion version of the Propositional Satisfiability (SAT) problem. Given an (usually 
unsatisfiable) CNF formula (j), the MaxSAT problem can be defined as finding 
an assignment to problem variables such that it minimizes the number of unsat¬ 
isfied clauses in (j). In partial MaxSAT, given a CNF formula (j> = (j)s ^ 4 >h, the 
goal is to find an assignment such that it minimizes the number of unsatisfied 
soft clauses in (ps while satisfying all hard clauses in pu- Although there are 
weighted variants of MaxSAT [7], in this paper we focus on partial MaxSAT. 

Let pH denote the relaxation of a partial MaxSAT formula p. In pn we 
associate a fresh relaxation variable rj with each soft clause ujj in ps such that 
pR = pH U {{uJj y Tj) : ojj € ps}- Notice that finding an assignment to the 
variables in pn such that it minimizes the number of relaxation variables assigned 
value 1 while satisfying all clauses is equivalent to solving the partial MaxSAT 
formula p. Hence, a common approach for solving partial MaxSAT is to relax p 
and iteratively call a SAT solver on pH with an additional constraint ^ T,<b 
encoded into CNF |3|. Initially, we can define b = |0s|. If a solution is found such 
that ^ relaxation variables are assigned value 1, then b is updated to /i — 1. The 
algorithm ends when the formula becomes unsatisfiable and the optimal solution 
is the last one found with value /r. 

The described algorithm performs a linear search on the number of satisfiable 
soft clauses. More recently, several algorithms have been proposed that take ad¬ 
vantage of the ability of SAT solvers to provide an unsatisfiable sub-formula in!- 


These unsatisfiability-based algorithms have been shown very successful for solv¬ 
ing industrial instances. We refer the reader to the literature for details m- 

Due to the predominance of multicore architectures instead of higher fre¬ 
quency CPUs, recent work on MaxSAT solving has been deviating to the design 
of parallel solvers. The most successful parallel algorithms implement a portfolio 
of sequential solvers. The portfolio may include both different algorithms or the 
same algorithm with different configurations. In parallel solutions, all the cores 
of a typical computer access primary memory through the same BUS, and no 
two cores can use the BUS at the same time. Therefore, as the number of cores 
increases, so does contention on the access to primary memory, hindering the 
scalability of parallel algorithms. 

Another alternative to sequential algorithms are distributed algorithms. On 
one hand, these algorithms do not suffer from the drawback mentioned above. 
On the other hand, distributed algorithms are naturally designed to surpass 
the limitations on diversification of portfolio-based approaches. This paper de¬ 
scribes DistMS, a distributed MaxSAT solver that implements two distributed 
algorithms. The first algorithm splits the search space by assigning different up¬ 
per bound values of the optimum solution to different processes. The second 
algorithm is based on choosing a subset of the problem’s variables and dividing 
the possible combinations of values for those variables among the processes. 

2 Distributed MaxSAT Solver DistMS 

In this section we present the distributed algorithms for MaxSAT implemented 
in the DistMS solver. The first algorithm is an adaptation of the parallel Search 
Space Splitting approach, first proposed by Martins et al. unci]. The second 
algorithm is based on the guiding paths splitting strategy mm, which has 
been shown to be successful in parallel and distributed SAT solvers. 

The architecture of DistMS is composed of one master and multiple slave 
processes. The master process mediates the whole communication between the 
slave processes, being responsible for assigning tasks to slaves and handling com¬ 
munication. On the other hand, slaves wait for a task to be given by the master, 
process that task and send the result back to the master. 

Unlike other distributed solvers [5|, slave processes in DistMS do not com¬ 
municate directly. All communication is established between the master and the 
slave processes. The main goal is to minimize the changes in the slave behaviour 
such that any MaxSAT algorithm can be easily incorporated into the DistMS 
solver. Although one might think that this would be a bottleneck in the mas¬ 
ter process, such behaviour has not been observed, even when the number of 
processes increases to several tens of slaves. 


2.1 Search Space Splitting Algorithm 

Given n processes, the Search Space Splitting algorithm is composed by 1 master, 
1 unsatisfiability-based process and n — 2 linear search processes. The master 






process is responsible for keeping a lower bound A and an upper bound /r on 
the optimum solution of the MaxSAT instance. The lower bound is initially 0. 
The master process starts with a SAT call containing only the hard clauses (pjj. 
Note that if this SAT call is unsatisfiable, then the solver terminates immedi¬ 
ately, returning that the instance is not satisfiable. Otherwise, the number of 
unsatisfiable soft clauses provides an initial upper bound fj,. 

The main goal of the search splitting algorithm is to split the set of possible 
values for the optimal solution. Given k linear search slave processes, pi,... ,pfc, 
the interval defined by the lower bound A and upper bound /r is split across 
the k processes. The initial bounds set is {6o, &i, fefc-i, ^fe} with bo = X and 
bk = fJ- — ^ and with each process pi being responsible for checking if a given 
tentative bound bi is either a lower or upper bound on the optimal solution. 

A slave process pi executes a SAT call on a relaxed MaxSAT formula t/fjj with 
the additional constraint ^ rj < bi encoded into CNF m- If the formula given 
to process pi is unsatisfiable, then bi is a lower bound and A can be updated. 
Otherwise, bi is an upper bound and p, can be updated with the number of 
relaxation variables Vj assigned value 1. 

Initially, the value of bi for process pi is given by \i x since we have 

A = 0. The master process maintains a sorted set B of bounds to be checked by 
the slaves. 

Example 21 Let p = S7 be the initial upper bound. Given fc = 6 linear search 
processes, the initial bounds set is B = {0,6,12,18,24,30,36}. 

If a given slave process finds a new lower bound, then A is updated and all 
values smaller than A are removed from the bound set B. Otherwise, if a slave 
process finds a new upper bound, then p is updated, all values larger than p are 
removed and p is added to B. 

Let B — {6o, 6i,..., 6fc_i, be the current bounds set. If a given slave process 
Pi needs a new bound to search on, then the master chooses a pair [bm-i,bm) 
of contiguous values such that bm — bm-i > bj — bj-i for all 1 < j < A:. A new 
tentative bound bi = is computed and bi is added to B. Furthermore, 

the new bound is sent to slave process pi as a tentative bound. 

Example 22 Let B = {5,12,22,27,40} be a bounds set. Suppose that pi finds 
that 26 is an upper bound. In this case, B is updated to B = {5,12,17, 22, 25} 
where 17 is the new tentative bound for slave process pi. Next, if pi finds that 
17 is a lower bound, then the bounds set is updated to B = {18, 20, 22, 25} where 
20 is Pi’s new tentative bound. 

Additionally to the slave processes that check tentative bounds, DistMS also 
includes a slave process executing an unsatisfiability-based algorithm. This is 
mainly to be able to quickly update the lower bound in few iterations, thus 
constraining the tentative bounds to be provided to other slave processes by the 
master. 

An optimal solution is found when the lower bound A is equal to the upper 
bound p. When this occurs, the master process aborts the execution of the 
remaining processes and terminates, returning p as the optimum value. 



2.2 Guiding Paths with Lookahead Algorithm 


Heule et al. [5] already proposed a parallel SAT algorithm that initially uses 
a lookahead solver to generate guiding paths in order to split the search tree. 
Lookahead solvers apply sophisticated reasoning at each branching step in order 
to guide the search more effectively. The algorithm described throughout the rest 
of this section is an extension of the previous approach to distributed MaxSAT. 

Given n processes, the guiding path algorithm in DistMS is composed by 1 
master, n — 2 guiding path solver processes and 1 linear search process. The 
master starts by generating a queue of guiding paths to be solved by the slave 
processes while waiting for an initial upper bound /i from the linear search 
process. The guiding paths are heuristically sorted and given to available slave 
processes with the best upper bound computed thus far. Each slave applies a 
linear search MaxSAT algorithm [T] and returns the best solution found for the 
given path to the master. If the newly found solution improves on the previous 
one, it is saved and the upper bound fi is updated. 

Note that unlike other guiding path solver architectures, the number of ini¬ 
tial guiding paths is usually much larger than the number of slaves. Hence, when 
a given guiding path is solved, the master immediately removes the first guid¬ 
ing path from the queue and sends it to the slave. The MaxSAT instance is 
considered solved when the guiding path queue becomes empty. 

When the master sends a guiding path to a slave process, it also provides 
the current upper bound The working formula on the slave process contains a 
relaxation of the MaxSAT formula (pn and a cardinality constraint X) G — ^ ~ ^ 
encoded into CNF. The guiding path literals are considered assumptions in the 
SAT solver calls occurring in the slave process. Therefore, if the working formula 
is unsatisfiable, the slave process is able to provide a reason for the unsatisfiability 
of the formula to the master. When the unsatisfiability does not depend on the 
guiding path, one can conclude that the working formula is not satisfiable due to 
the cardinality constraint and /r is a lower bound of the MaxSAT formula. As a 
result, the previously found solution fj, is optimal and the solver can terminate, 
even if there are guiding paths in the queue. 

The pseudo-code for the guiding path generation procedure is presented in 
algorithm This procedure receives as input a CNF formula p, the set C of 
guiding paths computed so far by the procedure, the current partial assignment 
D, the set I of literals that are implied by the partial assignment D and a cutoff 
value 9. Formula </> corresponds to the hard clauses of the MaxSAT instance. 

The algorithm starts by incrementing the cutoff value 0 (line 2). This is done 
to prevent 9 from being reduced too much by the decrement rule in line 4 and as 
a consequence generating too small guiding paths. The initial cutoff value is 1000 
as specified by Heule et al. [ 3 . In practice, 9 is incremented by 5%. Next, unit 
propagation is applied to simplify p and update set I (line 3). The algorithm 
then checks if p is unsatisfied by the current assignments (line 4). In this case, 9 
is decremented (line 5). 9 is also decremented if |I1| -l-log 2 \4>\> 2b (line 4). This 
rule prevents the guiding path generation process of going too deep in the search 


Algorithm 1: Guiding Path generation algorithm 

1 Procedure GenerateGuidingPaths (i^, C, D, I, 6) 

2 1 IncrementCutoff(0) 

3 I) ^ Propagate (0, D, I) 

4 <p is unsatisfied by D U I or |D| + log 2 \ 4>\ >25 then 

5 I DecrementCutoff(0) 

6 end 

7 if is unsatisfied by DU I then 

8 AnalyzeAndLearn((/), D, /) 

9 return C 

10 end 

11 if |_D| X |_D U 7| > 0 X I Vars((^) | then 

12 I return C U {D} 

13 end 

14 re- ChooseVariable((/), D, I) 

15 Z e- ChoosePolarity (</), x) 

16 G GenerateGuidingPaths ((/), C, DU{1}, I, 9) 

17 return GenerateGuidingPaths (i^, C, D VJ {-'1}, I, 9) 


tree and generating too many guiding paths. In practice, 6 is decremented by 
30%. 

If 4> is unsatisfied, then the procedure applies conflict analysis and 

learns a new clause (line 8), similarly to a CDCL SAT solver [S]. This may 
prevent the procedure from generating guiding paths that unsatisfy </>. If (j) is 
not unsatisfied, then the algorithm checks if the cutoff has been triggered (line 
11). If so, I? is returned as a guiding path. The cutoff condition takes into account 
the number of branching steps and the total number of assignments, explicit and 
implied, in the current node of the search tree. 

If the cutoff is not triggered, then an unassigned variable x is chosen heuris- 
tically to be added to D (line 14). Given a variable x, we denote as evalcis{x) 
(evalcisi^x)) the sum of the weights of the clauses that are reduced by the as¬ 
signment a; = 1 (a; = 0) but are not satisfied. The clauses are weighted in a way 
such that a clause with length k has a weight five times larger than a clause 
with length fc -|- 1. Variables are ranked by evalcia{x) x evalcis{~'x) and ties are 
broken by evalcis{x) + evalcia{~'x). 

Example 23 Let (() = {(a;i V a;2 Va;3), (a;2 V-13:3), (-ia:i Va;2)} be a CNF formula. 
Hence, evalcia{~^x^) = 6 (clauses with length 2 and 3 have weights 5 and 1, 
respectively, if S is the maximum clause size). 

In practice, a variation of the evalda heuristic, referred to as evalyji, is used to 
rank variables. Given a literal I, the only difference is that instead of considering 
all the clauses in cf, only the clauses watching m literal I are considered in the 
computation of evalwi{l)- Also, only variables in soft clauses are considered when 
choosing a new variable. 









Table 1. Experimental evaluation of DistMS 


Instance Group 

Total 

MSU3 

LinearSU 

GP-2:4 

GP-2:8 

SSS-2:4 

SSS-2:8 

crafted 

377 

260 

283 

290 

294 

279 

278 

industrial 

627 

551 

524 

518 

515 

562 

564 


After choosing a variable x, another heuristic is used to decide which truth 
value will be tested first (line 15). We choose the direction based on the number 
of clauses that will be unsatisfied after assigning x. The branch to be explored 
first is the one that unsatisfies a smaller number of soft clauses. Ties are broken 
choosing the direction that satisfies more soft clauses. The rationale for this is 
that the branch that unsatisfies less soft clauses is more likely to reach an upper 
bound closer to the optimum value. Algorithm is then repeated for x and —'X 
(lines 16 and 17). 

Note that, since the master sorts guiding paths as they are generated, in the 
long run the sorting heuristic dominates the polarity heuristic. The priority is 
given to the least restricting guiding path and ties are broken by choosing the 
one that was generated first. 

If there are idle processes and there are no guiding paths left in the queue, 
then one of the paths currently being solved is chosen to be further split into 
new guiding paths. We choose the path g that was assigned first. Algorithm 
[2 is re-invoked, but this time with D = g and I updated accordingly. When 
algorithm is re-invoked with D ^ 0 \s initialized as 5000, or else too few 

guiding paths would be generated. 


3 Experimental Results and Discussion 

The results in Table were obtained on the partial MaxSAT crafted and in¬ 
dustrial instances of the MaxSAT evaluation of 2013. DistMS was implemented 
on top of OpenWBO m and different configurations of DistMS are compared 
against the sequential counterparts. MSU3 refers to the OpenWBO’s unsatisfiability- 
based MSU3 algorithm, LinearSU to the OpenWBO’s linear search algorithm, 

GP -n:m to guiding paths with n processes per each one of the m machines 
and SSS-n:m to search space splitting with n processes per each one of the m 
machines. For each instance, algorithms were executed with a timeout of 1800 
seconds (wall clock time) and a memory limit of 4 GB per process. The tests 
were conducted on a cluster of machines with 4 AMD Opteron 6376 (2.3 GHz) 
and 128 GB of RAM, running Debian Jessie. 

Experimental results show that search space splitting (SSS) performs better 
in industrial instances, while using guiding paths (GP) allows DistMS to perform 
better in crafted instances. SSS solves more instances than the sequential solver 
and slightly increases its performance with a growing number of processes. How¬ 
ever, gains are small, since the solver quickly converges to near the optimum 
bound, and then the diversification of the search is small. 











The GP approach fails to perform in industrial instances. Unlike the SSS, in 
the GP approach it is hard to converge to the optimum in industrial instances. 
Nevertheless, observe that the GP approach is the best performing in crafted 
instances and it continues to improve as the number of processes grows. 

This paper proposes the first distributed MaxSAT solver. Although previ¬ 
ous multicore parallel approaches have been proposed, they fail to scale when 
the number of threads increases beyond 8 threads, since these are based in a 
portfolio of sequential solvers. As a result, DistMS integrates two non-portfolio 
strategies, namely search space splitting on the number of unsatisfied soft clauses 
and generation of guiding paths. Experimental results show that we are able to 
improve on the sequential solvers, but the scalability is still unclear. Given the 
mixed results from both approaches in different sets of instances, as future work 
we propose to integrate them into a unifying framework for distributed MaxSAT 
solving. 
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